Interpretable deep learning: interpretation, interpretability, trustworthiness, and beyond

نویسندگان

چکیده

Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to over-parameterized black-box nature, it is often difficult understand the prediction results deep models. In recent years, many interpretation tools proposed explain or reveal how models make decisions. this paper, we review line research try a comprehensive survey. Specifically, first introduce clarify two basic concepts—interpretations interpretability—that people usually get confused about. To address efforts in interpretations, elaborate designs number algorithms, from different perspectives, by proposing new taxonomy. Then, results, also survey performance metrics evaluating algorithms. Further, summarize current works models’ interpretability using “trustworthy” Finally, discuss connections between interpretations other factors, such as adversarial robustness several open-source libraries algorithms evaluation approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Interpretability and Generalisation for Deep Learning

متن کامل

Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

The lack of interpretability remains a key barrier to the adoption of deep models in many applications. In this work, we explicitly regularize deep models so human users might step through the process behind their predictions in little time. Specifically, we train deep time-series models so their classprobability predictions have high accuracy while being closely modeled by decision trees with ...

متن کامل

InterpNET: Neural Introspection for Interpretable Deep Learning

Humans are able to explain their reasoning. On the contrary, deep neural networks are not. This paper attempts to bridge this gap by introducing a new way to design interpretable neural networks for classification, inspired by physiological evidence of the human visual system’s inner-workings. This paper proposes a neural network design paradigm, termed InterpNET, which can be combined with any...

متن کامل

Interpretable Deep Learning applied to Plant Stress Phenotyping

Availability of an explainable deep learning model that can be applied to practical real world scenarios and in turn, can consistently, rapidly and accurately identify specific and minute traits in applicable fields of biological sciences, is scarce. Here we consider one such real world example viz., accurate identification, classification and quantification of biotic and abiotic stresses in cr...

متن کامل

A Deep Learning Interpretable Classifier for Diabetic Retinopathy Disease Grading

Deep neural network models have been proven to be very successful in image classification tasks, also for medical diagnosis, but their main concern is its lack of interpretability. They use to work as intuition machines with high statistical confidence but unable to give interpretable explanations about the reported results. The vast amount of parameters of these models make difficult to infer ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Knowledge and Information Systems

سال: 2022

ISSN: ['0219-3116', '0219-1377']

DOI: https://doi.org/10.1007/s10115-022-01756-8